Limits of Preprocessing* 



Stefan Szeider 

Vienna University of Technology, Vienna, Austria 
Stefan @ szeider.net 

Abstract 

We present a first theoretical analysis of the power of polynomial-time preprocessing for important 
combinatorial problems from various areas in AI. We consider problems from Constraint Satisfaction, 
Global Constraints, Satisfiability, Nonmonotonic and Bayesian Reasoning. We show that, subject to a 
complexity theoretic assumption, none of the considered problems can be reduced by polynomial-time 
preprocessing to a problem kernel whose size is polynomial in a structural problem parameter of the 
input, such as induced width or backdoor size. Our results provide a firm theoretical boundary for the 
performance of polynomial-time preprocessing algorithms for the considered problems. 

Keywords: Fixed-Parameter Tractability, Constraint Satisfaction, Global Constraints, Satisfiability, 
Bayesian Networks, Normal Logic Programs, Computational Complexity. 

1 Introduction 

Many important computational problems that arise in various areas of AI are intractable. Nevertheless, 
AI research was very successful in developing and implementing heuristic solvers that work well on real- 
world instances. An important component of virtually every solver is a powerful polynomial-time prepro- 
cessing procedure that reduces the problem input. For instance, preprocessing techniques for the proposi- 
tional satisfiability problem are based on Boolean Constraint Propagation (see, e.g., Een and Biere, 2005), 
CSP solvers make use of various local consistency algorithms that filter the domains of variables (see, 
e.g., Bessiere, 2006); similar preprocessing methods are used by solvers for Nonmonotonic and Bayesian 
reasoning problems (see, e.g., Gebser et al., 2008, Bolt and van der Gaag, 2006, respectively). 

Until recently, no provable performance guarantees for polynomial-time preprocessing methods have 
been obtained, and so preprocessing was only subject of empirical studies. A possible reason for the lack 
of theoretical results is a certain inadequacy of the P vs NP framework for such an analysis: if we could 
reduce in polynomial time an instance of an NP-hard problem just by one bit, then we can solve the entire 
problem in polynomial time by repeating the reduction step a polynomial number of times, and P = NP 
follows. 

With the advent of parameterized complexity (Downey, Fellows, and Stege, 1999), a new theoretical 
framework became available that provides suitable tools to analyze the power of preprocessing. Param- 
eterized complexity considers a problem in a two-dimensional setting, where in addition to the input 
size n, a problem parameter k is taken into consideration. This parameter can encode a structural aspect 
of the problem instance. A problem is called fixed-parameter tractable (FPT) if it can be solved in time 
f{k)p{n) where / is a function of the parameter k and p is a polynomial of the input size n. Thus, for 
FPT problems, the combinatorial explosion can be confined to the parameter and is independent of the 
input size. It is known that a problem is fixed-parameter tractable if and only if every problem input can 
be reduced by polynomial-time preprocessing to an equivalent input whose size is bounded by a function 
of the parameter (Downey, Fellows, and Stege, 1999). The reduced instance is called the problem ker- 
nel, the preprocessing is called kemelization. The power of polynomial-time preprocessing can now be 
benchmarked in terms of the size of the kernel. Once a small kemel is obtained, we can apply any method 
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of choice to solve the kernel: brute-force search, heuristics, approximation, etc. (Guo and Niedermeier, 
2007). Because of this flexibility a small kernel is generaUy preferable to a less flexible branching-based 
fixed-parameter algorithm. Thus, small kemels provide an additional value that goes beyond bare fixed- 
parameter tractabihty. 

In general the size of the kernel is exponential in the parameter, but many important NP-hard optimiza- 
tion problems such as Minimum Vertex Cover, parameterized by solution size, admit polynomial kemels, 
see, e.g., (Bodlaender et al., 2009) for references. 

In previous research several NP-hard AI problems have been shown to be fixed-parameter tractable. 
We list some important examples from various areas: 

• Constraint satisfaction problems (CSP) over a fixed universe of values, parameterized by the induced 
width (Gottlob, Scarcello, and Sideri, 2002). 

• Consistency and generalized arc consistency for intractable global constraints, parameterized by the 
cardinalities of certain sets of values (Bessiere et al., 2008). 

• Prepositional satisfiability (SAT), parameterized by the size of backdoors (Nishimura, Ragde, and 

Szeider, 2004). 

• Positive inference in Bayesian networks with variables of bounded domain size, parameterized by 
size of loop cutsets (Pearl, 1988; Bidyuk and Dechter, 2007). 

• Normionotonic reasoning with normal logic programs, parameterized by feedback width (Gottlob, 
Scarcello, and Sideri, 2002). 

However, only exponential kemels are known for these fundamental AI problems. Can we hope for 
polynomial kernels? 

Results Our results are throughout negative. We provide strong theoretical evidence that none of the 
above fixed-parameter tractable AI problems admits a polynomial kernel. More specifically, we show that 
a polynomial kernel for any of these problems causes a collapse of the Polynomial Hierarchy to its third 
level, which is considered highly unlikely by complexity theorists. 

Our results are general: The kernel lower bounds are not hmited to a particular preprocessing tech- 
nique but apply to any clever technique that could be conceived in future research. Hence the results 
contribute to the foundations of Al. 

Our results suggest the investigation of altemative approaches to polynomial-time preprocessing; for 
instance, preprocessing that produces in polynomial time a Boolean combination of polynomially sized 
kemels instead of one single kernel. 

2 Formal Background 

A parameterized problem P is a subset of E* x N for some finite alphabet S. For a problem instance 
[x, fc) e S* X N we caU x the main part and k the parameter. We assume the parameter is represented in 
unary. For the parameterized problems considered in this paper, the parameter is a function of the main 
part, i.e., k — n{x) for a function tt. We then denote the problem as P(7r), e.g., C/-CSP(width) denotes 
the problem [/-CSP parameterized by the width of the given tree decomposition. 

A parameterized problem P is fixed-parameter tractable if there exists an algorithm that solves any 
input (x, fc) G S* X N in time 0{f{k) ■ p{\x\) where / is an arbitrary computable function of k and p is 
a polynomial in n. 

A kemelization for a parameterized problem P C S* x N is an algorithm that, given (.x, fc) e S* x N, 
outputs in time polynomial in |a;| + fc a pair (.r'. A;') G S* x N such that (i) (.t, k) G P if and only if 
{x' , k') e P and (ii) \x'\ -{- k' < g{k), where g is an arbitrary computable function, called the size of the 
kemel. In particular, for constant k the kernel has constant size g{k). If is a polynomial then we say that 
P admits a polynomial kemel. 
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Every fixed-parameter tractable problem admits a kernel. This can be seen by the following argument 
due to Downey, Fellows, and Stege (1999). Assume we can decide instances {x, k) of problem P in time 
f{k)\n\'-'^^\ We kemelize an instance (x, k) as follows. If |a;| < f{k) then we already have a kernel of 
size f{k). Otherwise, if \x\ > f{k), then /(A;)|a;|'^(^) < |a;|'^'^^) is a polynomial; hence we can decide 
the instance in polynomial time and replace it with a small decision-equivalent instance {x', k'). Thus 
we always have a kernel of size at most /(A;). However, f{k) is super-polynomial for NP-hard problems 
(unless P = NP), hence this generic construction is not providing polynomial kernels. 

We understand preprocessing for an NP-hard problem as a polynomial-time procedure that transforms 
an instance of the problem to a (possible smaller) solution-equivalent instance of the same problem. Ker- 
nelization is such a preprocessing with a performance guarantee, i.e., we are guaranteed that the prepro- 
cessing yields a kernel whose size is bounded in terms of the parameter of the given problem instance. In 
the literature also different forms of preprocessing have been considered. An important one is knowledge 
compilation, a two-phases approach to reasoning problems where in a first phase a given knowledge base 
is (possibly in exponential time) preprocessed ("compiled"), such that in a second phase various queries 
can be answered in polynomial time (Cadoli et al., 2002). 

3 Tools for Kernel Lower Bounds 

In the sequel we will use recently developed tools to obtain kernel lower bounds. Our kernel lower bounds 
are subject to the widely believed complexity theoretic assumption NP % co-NP/poly (or equivalently, 
PH ^ Sp). In other words, the tools allow us to show that a parameterized problem does not admit a poly- 
nomial kernel unless the Polynomial Hierarchy collapses to its third level (see, e.g., Papadimitriou, 1994). 

A composition algorithm for a parameterized problem P C S* x N is an algorithm that receives as 
input a sequence {xi, k), . . . , {xt, A:) e S* x N, uses time polynomial in X]i=i + ^^id outputs 
{y, k') e S* X N with (i) (y, k') e P if and only if {xi, k) G P for some 1 < i < t, and (ii) k' is 
polynomial in k. A parameterized problem is compositional if it has a composition algorithm. With each 
parameterized problem PCS* x N we associate a classical problem 

UP[P] = {x#l'= ■.{x,k)&P} 

where 1 denotes an arbitrary symbol from E and # is a new symbol not in S. We call UP[P] the unpa- 
rameterized version of P. 

The following result is the basis for our kernel lower bounds. 

Theorem 1 (Bodlaender et al., 2009, Fortnow and Santhanam, 2008). Let V he a parameterized problem 
whose unparatiieterized version is N¥-complete. IfP is compositional, then it does not admit a polynomial 
kernel unless NP C co-NP//5o/>', i.e., the Polynomial Hierarchy collapses. 

Let P, Q C E* X N be parameterized problems. We say that P is polynomial parameter reducible 
to Q if there exists a polynomial time computable function X : S* x N ^ S* x N and a polynomial 
p, such that for all (x, A:) G E* x N we have (i) (x, A:) G P if and only if K{x, k) = {x', k ) G Q, and 
(ii) k' < p{k). The function K is called a polynomial parameter transformation. 

The following theorem allows us to transform kernel lower boimds from one problem to another. 

Theorem 2 (Bodlaender, Thomasse, and Yeo, 2009). Let P and Q be parameterized problems such that 
UP[P] is NF-complete, UP[Q] is in NP, and there is a polynomial parameter transformation from P to Q. 
IfQ has a polynomial kernel, then P has a polynomial kernel. 

4 Constraint Networks 

Constraint networks have proven successful in modeling everyday cognitive tasks such as vision, language 
comprehension, default reasoning, and abduction, as well as in applications such as scheduling, design, 
diagnosis, and temporal and spatial reasoning (Dechter, 2010). A constraint network is a triple / = 
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(y, U, C) where V is a finite set of variables, ?7 is a finite universe of values, and C = {Ci, . . . , Cm} is 
set of constraints. Each constraint Ci is a pair {Si, Ri) where 5*^ is a list of variables of length rj called 
the constraint scope, and Rt is an rj-ary relation over U, called the constraint relation. The tuples of Ri 
indicate the allowed combinations of simultaneous values for the variables Si. A solution is a mapping 
T : y — )• f7 such that for each 1 <i <m and Si = [xi, . . . , XrJ, we have (t(xi), . . . , T{xr-)) G i?,. A 
constraint network is satisfiahle if it has a solution. 

With a constraint network / = {V,U,C) we associate its constraint graph G = {V,E) where E 
contains an edge between two variables if and only if they occur together in the scope of a constraint. 
A width w tree decomposition of a graph G is a pair (T, A) where T is a tree and A is a labeling of the 
nodes of T with sets of vertices of G such that the following properties are satisfied: (i) every vertex of G 
belongs to A(p) for some node p of T; (ii) every edge of G is is contained in X{p) for some node p of T; 

(iii) For each vertex z; of G the set of all tree nodes p with v e X{p) induces a connected subtree of T; 

(iv) |A(p)| — 1 < w holds for aU tree nodes p. The treewidth of G is the smallest w such that G has a 
width w tree decomposition. The induced width of a constraint network is the treewidth of its constraint 
graph (Dechter and Pearl, 1989). We note in passing that the problem of finding a tree decomposition of 
width w is NP-hard but fixed-parameter tractable in w. 

Let U h&dL fixed universe containing at least two elements. We consider the following parameterized 
version of the constraint satisfaction problem (CSP). 

?7-CSP(width) 

Instance: A constraint network J = (y,U,C) and a width w tree decomposition of the constraint 
graph of /. 

Parameter: The integer w. 

Question: Is / satisfiable? 

It is well known that U -CSP(width) is fixed-parameter tractable over any fixed universe U (Dechter and 
Pearl, 1989; Gottlob, Scarcello, and Sideri, 2002) (for generalizations see Samer and Szeider, 2010). We 
contrast this classical result and show that it is unUkely that f7-CSP(width) admits a polynomial kernel, 
even in the simplest case where U = {0, 1}. 

Theorem 3. {0, l}-CSP(width) does not admit a polynomial kernel unless the Polynomial Hierarchy 
collapses. 

Proof. We show that {0, l}-CSP(width) is compositional. Let {Ii,Ti), 1 < i < be a given sequence 
of instances of {0, l}-CSP(width) where /; = [Vi, Ui,Ci) is a constraint network and is a width 
w tree decomposition of the constraint graph of We may assume, w.l.o.g., that Vi H Vj = for 
^ < i < j < t (otherwise we can simply change the names of variables). We form a new constraint 
network I — {V, {0, 1}, C) as follows. We put V = Ui=i ^ ^ {'^i' ■ ■ ■ ,at,bo, ■ ■ . , bt} where Ui, hi are 
new variables. We define the set C of constraints in three groups. 

(1) For each I < i < t and each constraint C = {{xi, . . . , Xr),R) & Ci we add to C a new constraint 
G' = {{xi,..., Xr,ai),R')) where R' { (wi, . . . , u^, 0) : G i? } U {(1, . . . , 1)}. 

(2) We add t ternary constraints G^*, . . . , G; where G* = ((6i-i, 6,, Oj), R*) and R* = {(0, 0, 1), 
(0,1,0), (1,1,1)}. 

(3) Finally, we add two unary constraints C^ = {{bo), (0)) and G^ = {{bt), (1)) which force the 
values of bo and bt to and 1, respectively. 

Let G, Gj be the constraint graphs of I and Ii, respectively. Fig. 1 shows an illustration of Gfort = 4. 
We observe that ai, . . . , a* are cut vertices of G. Removing these vertices separates G into independent 
parts P,G'i, . . . ,G't where P is the path bo,bi, . . . ,bt, and G^ is isomorphic to Gi. By standard techniques 
(see, e.g., Kloks, 1994), we can put the given width w tree decompositions Ti, . . . , of G'^, . . . , G^ and 
the trivial width 1 tree decomposition of P together to a width w + 1 tree decomposition T of G. Clearly 
(J, T) can be obtained from (Jj, Tj), 1 < i < t,m polynomial time. 

We claim that I is satisfiable if and only if at least one of the Ii is satisfiable. This claim can be 
verified by means of the following observations: The constraints in groups (2) and (3) provide that for 
any satisfying assignment there will be some < i < t ~ 1 such that bo, ■ ■ ■ ,bi are all set to and 
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Figure 1: Constraint graph G. 



. ■ . ,bt are all set to 1; consequently is set to and all aj for j i are set to 1. The constraints in 
group (1) provide that if we set to 0, then we obtain from C the original constraint C; if we set to 1 
then we obtain a constraint that can be satisfied by setting all remaining variables to 1. We conclude that 
{0, l}-CSP(width) is compositional. 

In order to apply Theorem 1, it remains to estabhsh that the unparameterized version of 
{0, l}-CSP(width) is NP-complete. Deciding whether a constraint network I over the universe {0, 1} 
is satisfiable is well-known to be NP-complete (say by reducing 3-SAT). To a constraint network / on n 
variables we can always add a trivial width u; = n — 1 tree decomposition of its constraint graph (taking a 
single tree node t where X{t) contains all variables of I). Hence UP[{0, l}-CSP(width)] is NP-complete. 

□ 



5 Satisfiability 

The pwpositional satisfiability problem (SAT) was the first problem shown to be NP-hard (Cook, 1971). 
Despite its hardness, SAT solvers are increasingly leaving their mark as a general-purpose tool in areas 
as diverse as software and hardware verification, automatic test pattern generation, planning, scheduling, 
and even challenging problems from algebra (Gomes et al., 2008). SAT solvers are capable of exploiting 
the hidden structure present in real-world problem instances. The concept of backdoors, introduced by 
Williams, Gomes, and Selman (2003) provides a means for making the vague notion of a hidden structure 
explicit. Backdoors are defined with respect to a "sub-solver" which is a polynomial-time algorithm that 
correctly decides the satisfiability for a class C of CNF formulas. More specifically. Gomes et al. (2008) 
define a sub-solver to be an algorithm A that takes as input a CNF formula F and has the following 
properties: (i) Trichotomy: A either rejects the input F, or determines F correctly as unsatisfiable or 
satisfiable; (ii) Efficiency. A runs in polynomial time; (iii) Trivial Solvability: A can determine if F is 
trivially satisfiable (has no clauses) or trivially unsatisfiable (contains only the empty clause); (iv.) Self- 
Reducibility: if A determines F, then for any variable x and value e S {0, 1}, A determines F[x = e]. 
F[t] denotes the formula obtained from F by applying the partial assignment r, i.e., satisfied clauses are 
removed and false hterals are removed from the remaining clauses. 

We identify a sub-solver A with the class of CNF formulas whose satisfiabihty can be determined 
by A. A strong A-backdoor set (or A-backdoor, for short) of a CNF formula F is a set B of variables 
such that for each possible truth assignment r to the variables in B, the satisfiability of F[t\ can be 
determined by sub-solver A in time 0{n'^). Hence, if we know an ^-backdoor of size k, we can decide 
the satisfiability of F by running A on 2'^ instances F[t], yielding a time bound of 0{2^n''). Hence SAT 
decision is fixed-parameter tractable in the backdoor size k for any sub-solver A. Hence the following 
problem is clearly fixed-parameter tractable for any sub-solver A. 

SAT(^-backdoor) 

Instance: A CNF formula F, and an >l-backdoor B of F of size k. 

Parameter: The integer k. 

Question: Is F satisfiable? 
We are concerned with the question of whether instead of trying aU 2*^ possible partial assigrmients we 
can reduce the instance to a polynomial kernel. We will estabhsh a very general result that apphes to all 
possible sub-solvers. 
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Theorem 4. SAT(yl-backdoor) does not admit a polynomial kernel for any sub-solver A unless the 
Polynomial Hierarchy collapses. 

Proof. We will devise polynomial parameter transformations from the following parameterized problem 
which is known to be compositional (Fortnow and Santhanam, 2008) and therefore unUkely to admit a 
polynomial kernel. 

SAT(vars) 

Instance: A propositional formula F in CNF on n variables. 
Parameter: The number n of variables. 
Question: Is F satisfiable? 

Let F be a CNF formula and V the set of all variables of F. Due to property (ii) of a sub-solver, V is 
an ^-backdoor set for any A. Hence, by mapping [F, n) (as an instance of SAT(vars)) to (F, V) (as an 
instance of SAT(^-backdoor)) provides a (trivial) polynomial parameter transformation from SAT(vars) 
to SAT(^-backdoor). Since the unparameterized versions of both problems are clearly NP-complete, the 

result follows by Theorem 2. □ 

Let 3SAT(7r) (where tt is an arbitrary parameterization) denote the problem SAT(7r) restricted to 
3CNF formula, i.e., to CNF formulas where each clauses contains at most three literals. In contrast 
to SAT(vars), the parameterized problem 3SAT(vars) has a trivial polynomial kernel: if we remove 
duplicate clauses, then any 3CNF formula on n variables contains at most 0{n^) clauses, and so is a 
polynomial kernel. Hence the easy proof of Theorem 4 does not carry over to 3SAT(^-backdoor). We 
therefore consider the cases 3SAT(HORN-backdoor) and 3SAT(2CNF-backdoor) separately, these cases 
are important since the detection of Horn and 2CNF-backdoors is fixed-parameter tractable (Nishimura, 
Ragde, and Szeider, 2004). 

Theorem 5. Neither 3SAT(HORN-backdoor) nor 3SAT(2CNF-backdoor) admit a polynomial kernel 
unless the Polynomial Hierarchy collapses. 

Proof LetC € {H0RN,2CNF}. We show that 3SAT(C -backdoor) is compositional. Let {Fi,B,), 1 < 
i < be a given sequence of instances of 3SAT(C -backdoor) where Fi is a 3CNF formula and i3, is a 
C-backdoor set of Fi of size k. We distinguish two cases. 

Case 1: t > 2*^. Let := "^(-jf^p. \C\ and n := max'^;^ \\Fi\\- Whether Fi is satisfiable or not can 
be decided in time 0{2^n) since the satisfiabihty of a Horn or 2CNF formula can be decided in linear time. 
We can check whether at least one of the formulas Fi, . . . , Ft is satisfiable in time 0(i2'=n) < ©(i^n) 
which is polynomial in f + n. If some Fi is satisfiable, we output (Fj, Bj); otherwise we output {F\,Bi) 
(Fi is unsatisfiable). Hence we have a composition algorithm. 

Case 2: t < 2*^. This case is more involved. We construct a new instance (F, B) of 3SAT(C-backdoor) 
as follows. 

Let s = [log2 1\ . Since t <2^, s <k follows. 

Let Vi denote the set of variables of Fj. We may assume, w.l.o.g., that Bi = ■ ■ ■ = Bt and that 

Vi OVj = Bi for all 1 < j < j < t since otherwise we can change names of variable accordingly. In 
a first step we obtain from every Fj a CNF formula F^' as follows. For each variable x £ Vi \ Bi we 
take two new variables a;o and xi. We replace each positive occurrence of a variable x G Vi\ Bi in Fi 
with the literal xq and each negative occurrence of x with the literal -^Xg. We add all clauses of the form 
{-'Xj-i V Xj) for 1 < j < s; we call these clauses "connection clauses." Let F/ be the formula obtained 
from Fj in this way. We observe that F/ and Fj are SAT-equivalent, since the connection clauses form an 
implication chain. Since the connection clauses are both Horn and 2CNF, Bi is also a C-backdoor of F/. 

We take a set F = {yi, . . . , y^} of new variables. Let Ci, . . . , be the sequence of all 2* possible 
clauses (modulo permutation of Uterals within a clause) containing exactly s Uterals over the variables 
in Y. Consequently we can write Ci as {£\\/ ■ ■ ■ \/ £1) where £l G {?/;, ^yj}. 

For 1 < i < twc add to each connection clause {-'Xj-i V xj) of F^' the literal e Ci. Let F" denote 
the 3 CNF formula obtained from F- this way. 
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For f < i < 2* we define 3CNF formulas F" as follows. If s < 3 then F" consists just of the 
clause Cj. If s > 3 then we take new variables Z2, ■ ■ ■ , zl_2 and let F^' consist of the clauses {£\ V £2 V 
-4), (4 V 4 V -4), . . . , (4_2 V zl_3 V ^zl_2), V 4 V 4_2). Finally, we let F be the 3CNF 
formula containing all the clauses from F[',. . . , F'^s . Any assignment r to F U Bi that satisfies Cj can be 
extended to an assignment that satisfies F^ since such assignment satisfies at least one connection clause 
(.Tj_i V .Xj V t^) and so the chain of implications from from Xq to Xg is broken. 

It is not difficult to verify the following two claims, (i) F is satisfiable if and only if at least one of 
the formulas Fi is satisfiable. (ii) B = F U Bi is a C-backdoor of F. Hence we have also a composition 
algorithm in Case 2, and thus 3SAT(C -backdoor) is compositional. Clearly UP[3SAT(C-backdoor)] is 
NP-complete, hence the result follows from Theorem 1. □ 

6 Global Constraints 

The success of today's constraint solvers relies heavily on efficient algorithms for special purpose global 
constraints (van Hoeve and Katriel, 2006). A global constraint specifies a pattern that frequently occurs 
in real-world problems, for instance, it is often required that variables must all take different values (e.g., 
activities requiring the same resource must all be assigned different times). The AllDifferent global 
constraint efficiently encodes this requirement. 

More formally, a global constraint is defined for a set S of variables, each variable x € S' ranges over 
a finite domain dom{x) of values. An instantiation is an assignment a such that a{x) G dom{x) for each 
X £ S. A global constraint defines which instantiations are legal and which are not. A global constraint 
is consistent if it has at least one legal instantiation, and it is domain consistent (or hyper arc consistent) 
if for each variable x G S and each value d e dom{x) there is a legal instantiation a with a{x) = d. For 
all global constraints considered in this paper, domain consistency can be reduced to a quadratic number 
of consistency checks, hence we will focus on consistency. We assume that the size of a representation of 
a global constraint is polynomial in X^^es I dom{x) \ . 

For several important types T of global constraints, the problem of deciding whether a constraint of 
type T is consistent (in symbols T-Cons) is NP-hard. Examples for such intractable types of constraints 
are NValue, Disjoint, and Uses (Bessiere et al., 2004). An NValue constraint over a set X of 
variables requires from a legal instantiation a that : x e X}\ = N; AllDifferent is the 

special case where A'^ = The global constraints DISJOINT and USES are specified by two sets of 
variables X, Y\ Disjoint requires that a{x) ^ a{y) for each pair x & X and y &Y\ Uses requires 
that for each x € X there is some y G Y such that a{x) = a{y). For a set X of variables we write 
dom,{X) =■ lJa;ex dom{x). 

Bessiere et al. (2008) considered dx = \dom{X)\ as parameter for NValue, dxy = \dom{X) n 
dom{Y)\ as parameter for Disjoint, and dy = \dom{Y)\ as parameter for Uses. They showed that 
consistency checking is fixed-parameter tractable for the constraints under the respective parameteriza- 
tions, i.e., the problems NVALUE-CONS((ia;), DiSJOiNT-CONS(da;y), and USES-CONS(c!y) are fixed- 
parameter tractable. We show that it is unlikely that their results can be improved in terms of polynomial 
kemels. 

Theorem 6. The problems NVALUE-CONS(d.T), DiSJ0iNT-CONS((ia;y), USES-CONS((iy) do not admit 

polynomial kernels unless the Polynomial Hierarchy collapses. 

Proof. We devise a polynomial parameter reduction from SAT(vars). We use a construction of Bessiere 
et al. (2004). Let F = {Ci, . . . , C„} be a CNF formula over variables xi,..., x„. We consider the 
clauses and variables of F as the variables of a global constraint with domains dom{xi) ~ {— z, i}, 
and dom{Cj) = {i : Xi G Cj } U { — i : -iXj G Cj}. Now F can be encoded as an NVALUE 
constraint with X = {x\,. . . , a;„, Ci, . . . , Cm} and N = n (clearly F is satisfiable if and only if the 
constraint is consistent). Since dx = 2n we have a polynomial parameter reduction from SAT(vars) to 
NVALUE-CONS(iia;). Similarly, as observed by Bessiere et al. (2009), F can be encoded as a Disjoint 
constraint with X = {xi, . . . , a;„} and Y = {Ci, . . . , Cm} (dxy < 2n), or as a Uses constraint with 
X = {Ci, . . . , Cm} and Y = {xi, . . . , x„} (dy = 2n). Since the unparameterized problems are clearly 
NP-complete, the result follows by Theorem 2. □ 

Further results on kernels for global constraints hav^been obtained by Gaspers and Szeider (201 1). 
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Bayesian networks (BNs) have emerged as a general representation scheme for uncertain knowledge 
(Pearl, 2010). A BN models a set of stochastic variables, the independencies among these variables, and 
a joint probabiUty distribution over these variables. For simphcity we consider the important special case 
where the stochastic variables are Boolean. The variables and independencies are modelled in the BN by 
a directed acycUc graph G = {V,A), the joint probability distribution is given by a table Ty for each node 
V €V which defines a probability T^\jj for each possible instantiation U = {d\, . . . ,ds) € {true, false}* 
of the parents t'l, . . . , Vg of v in G. The probability Pr(t/) of a complete instantiation U of the variables 
of G is given by the product oiTy\u over all variables v. We consider the problem Positive-BN-Inference 
which takes as input a Boolean BN (G, T) and a variable v, and asks whether Pr(?; = true) > 0. The 
problem is NP-complete (Cooper, 1990) and moves from NP to #P if we ask to compute Pr(f = true) 
(Roth, 1996). The problem can be solved in polynomial time if the BN is singly connected, i.e, if there 
is at most one undirected path between any two variables (Pearl, 1988). It is natural to parametrize the 
problem by the number of variables one must delete in order to make the BN singly connected (the deleted 
variables form a loop cutset). In fact, PosiTlVE-BN-lNFERENCE(loop cutset size) is easily seen to be 
fixed-parameter tractable as we can determine whether Pr(t; = true) > by taking the maximum of 
Pr(i' = true | U) over all 2^ possible instantiations of the k cutset variables, each of which requires 
processing of a singly connected network. However, although fixed-parameter tractable, it is unlikely that 
the problem admits a polynomial kernel. 

Theorem 7. PosiTlVE-BN-lNFERENCE(loop cutset size) does not admit a polynomial kernel unless 

the Polynomial Hierarchy collapses. 

Proof. (Sketch.) We give a polynomial parameter transformation from SAT(vars) and apply Theorem 2. 
The reduction is based on the reduction from 3SAT given by Cooper (1990). However, we need to allow 
clauses with an arbitrary number of literals since, as observed above, 3S AT(vars) has a polynomial kernel. 
Let F be a CNF formula on n variables. We construct a BN (G, T) such that for a variable v we have 
Pr(w = true) > if and only if F is satisfiable. Cooper uses input nodes Ui for representing variables 
of F, clause nodes Ci for representing the clauses of F, and conjunction nodes di for representing the 
conjunction of the clauses. We proceed similarly, however, we cannot represent a clause of large size with 
a single clause node Cj, as the required table T^- would be of exponential size. Therefore we split clauses 
containing more than 3 literals into several clause nodes, as indicated in Figure 2. It remains to observe that 




Figure 2: BN representation of a clause on four literals. 

the set of input nodes E = {m , . . . , u„} is a loop cutset of the constructed BN, hence we have indeed a 
polynomial parameter transformation from SAT(vars) to PosiTIVE-BN-lNFERENCE(loop cutset size). 
The result follows by Theorem 2. □ 



8 Nonmonotonic Reasoning 

Logic programming with negation under the stable model semantics is a well-studied form of nonmono- 
tonic reasoning (Gelfond and Lifschitz, 1988; Marek and Truszczyiiski, 1999). A (normal) logic pro- 
gram P is a finite set of rules r of the form 

h < — ai A • • • A ttm A -ifti A • • • A -^bn 

where h, Ui, hi are atoms, where h forms the head and the a^, hi from the body of r. We write H{r) = h, 
(r) = {a\ , . . . , Oto}, and B~ (r) = {6i , . . . , 6„}. Let 7 be a finite set of atoms. The GF reduct of 
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a logic program P under / is the program obtained from P by removing all rules r with B~ {r)C\I ^ 0, 
and removing from the body of each remaining rule r' all Uterals -^b with fee /. / is a stable model of P 
if 7 is a minimal model of P^ , i.e., if (i) for each rule r G P^ with (r) C I we have H{r) € /, and (ii) 
there is no proper subset of I with this property. The undirected dependency graph U{P) of P is formed 
as follows. We take the atoms of P as vertices and add an edge x — y between two atoms x, y if there is 
a rule r € P with H{r) = x and y G B^(r), and we add a path x — u — y\f H{r) — x and y G (r) 
{u is a new vertex of degree 2). Thi feedback width of P is the size of a smallest set V of atoms such that 
every cycle of U (P) runs through an atom in V. 

A fundamental computational problems is Stable Model Existence (SME), which asks whether a 
given normal logic program has a stable model. The problem is well-known to be NP-complete (Marek 
and Truszczyriski, 1991). Gottlob, Scarcello, and Sideri (2002) showed that SME(feedback width) is 
fixed-parameter tractable (see Fichte and Szeider (2011) for generalizations). We show that this result 
cannot be strengthened with respect to a polynomial kernel. 

Theorem 8. SME(feedback width) does not admit a polynomial kernel unless the Polynomial Hierarchy 
collapses. 

Proof. (Sketch.) We give a polynomial parameter transformation from SAT(vars) to SME (feedback 
width) using a construction of Niemela (1999). Given a CNF formula F on n variables, we construct a 
logic program P as follows. For each variable a; of we take two atoms x and x and include the rules 
{x -^x) and [x ^ ^x); for each clause C of F we take an atom c and include for each positive literal 
a of C the rule (c a), and for each negative literal of C the rule (c ^ a); finally, we take two 
atoms s and / and include the rule {f ■'^ ~'f A ~'s) and for each clause C of F the rule (s c). is 
satisfiable if and only if P has a stable model (Niemela, 1999). It remains to observe that each cycle of 
U (P) runs through a vertex inV = {x,x : x E vars(F) }, hence the feedback width of P is at most 2n. 
Hence we have a polynomial parameter transformation from SAT(vars) to SME(feedback width). The 
result follows by Theorem 2. □ 

9 Conclusion 

We have established super-polynomial kernel lower bounds for a wide range of important AI problems, 
providing firm limitations for the power of polynomial-time preprocessing for these problems. We con- 
clude from these results that in contrast to many optimization problems (see Section 1), typical AI prob- 
lems do not admit polynomial kernels. Our results suggest the consideration of alternative approaches. 
For example, it might still be possible that some of the considered problems admit polynomially sized 
Turing kernels, i.e., a polynomial-time preprocessing to a Boolean combination of a polynomial number 
of polynomial kernels. In the area of optimization, parameterized problems are known that do not admit 
polynomial kernels but admit polynomial Turing kernels (Fernau et al., 2009). This suggests a theoretical 
and empirical study of Turing kernels for the AI problems considered. 
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